Measuring Risk and Information Preservation: Toward New Metrics for De-identification of Clinical Texts
نویسندگان
چکیده
Current metrics for de-identification are based on information extraction metrics, and do not address the real-world questions “how good are current systems”, and “how good do they need to be”. Metrics are needed that quantify both the risk of re-identification and information preservation. We review the challenges in de-identifying clinical texts and the current metrics for assessing clinical de-identification systems. We then introduce three areas to explore that can lead to metrics that quantify reidentification risk and information preservation.
منابع مشابه
I-11: Optimal Strategy Toward Fertility Preservation
Background There are several indications of human female gamete cryostorage including sub-fertile and fertile patients. But our focus will be in women at risk of losing their reproductive function caused by oncologycal treatments or premature ovarian failure that could benefit greatly from this practice. Fertile women may take advantage of this technology to electively delay childbearing. The o...
متن کاملنیاز اطلاعاتی و رفتار اطلاعیابی پرستاران: مروری بر مطالعات انجام شده در جهان
Purpose: This study was carried out to assess the information seeking behavior among nurses and to recognize their information needs through analysis of research papers published in this field in international journals. Methodology: Citation analysis was used. The population under study included the articles in the field of nurses’ information seeking behavior, which were indexed in databases ...
متن کاملReview of ranked-based and unranked-based metrics for determining the effectiveness of search engines
Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...
متن کاملMeasuring Interlanguage: Native Language Identification with L1-influence Metrics
The task of native language (L1) identification suffers from a relative paucity of useful training corpora, and standard within-corpus evaluation is often problematic due to topic bias. In this paper, we introduce a method for L1 identification in second language (L2) texts that relies only on much more plentiful L1 data, rather than the L2 texts that are traditionally used for training. In par...
متن کاملMeasuring spatial - temporal of Yazd urban form using spatial metrics
Abstract Urban form can be affected by diverse factors in different times. Socio- economic, political and physical factors are among the main contributors. So, one of the most important challenges of urban planners is measuring and identifying urban development pattern in order to direct and strengthen it to sustainable pattern and right direction. The case study of the present paper is the ...
متن کامل